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ABSTRACT 

The depth and range of the corpus of written Chinese 
symbols used by 9-year-old Hong Kong children were studied in order 
to compile a sinimum written Chinese vocabulary list. Error patterns 
of children were also analyzed. After presenting some characteristics 
of the language and reviewing previous studies, the methodology was 
described. The sample size ranged from 407 to 461 children who were 
matched for age, sex, and academic performance. The 1,851 different 
symbols or characters compiled from survey returns were incorporated 
into 30 prose passages of 100 characters each (with about 40 percent 
recurring characters) and recorded on tape. Taped explanations of the 
passage content preceded three repetitions of the passage during 
which the children listened, transcribed, and reviewed. From the 
accumulation of correct and incorrect responses, a facility index was 
derived for each of the 1,851 characters. It was concluded (1) that 
719 characters were reproduced by at least 70 percent of the children 
and should be regarded as the minimus core of written vocabulary at 
this level end (2) that the mean number of strokes for characters of 
median difficulty was 14. Further teseatch in auditory reception of 
language and encoding of the symbols received is recommended. 
References are included. *CL) 



946 EDO 44245 



Che Kan Leong 
Assistant Professor of 
Educat Ion, 

Special Education Depeitmen 
University of Saskatchewan, 
Saskatoon, Saskatchewan, 
Canada . 



U ft DIMM MINI OF M1AITK. IOVC ATlON 
ft WIUAM 

omci or KOUCATIOM 

fMil OOCWIN7 HAft ItIN MfftODUCfD 
IXACHV A| ftECIIYID MOW TM I Pi MO H Oft 
0A6AM2ATI0N OftlO»NATlVO fT fOiNTft Of 
VltW Oft OftfOON* MATEO 00 KO' NEC t* 
ftAftilt ftl f fttllNT Of FKlAl Off ICE Of EOCI 
CATION WftlTlO* Oft ftOuCf 



INTERNATIONAL READING ASSOCIATION 
FIFTEENTH ANNUAL CONVENTION - MAY, 1970 
Readtnft at the International Level * Friday, 8th May i 1970 at 4.00 p>m» 



n An Experimental Study of the Vocabulary of Written Chinese amons 
Primary III Children In Hong Kon r " 



Oi 

o 

© 



This paper reports the results of the first stage of a longitudinal 
project which aims at studying the minimum vocabulary of written Chinese 
of primary school children In Hong Kong. The knowledge of this aspect of 
the language calls for the active ability to reproduce written symbols 
as contrasted with the more passive command of the reading vocabulary. 

In this study the main aim was to investigate both the range and depth of 
thr corpus of written Chinese symbols or characters used by children In 
the third year of primary schooling with a view to compiling a list of 
minimum written Chinese vocabulary. A subsidiary aim was to analyse 
error patterns of the children for more effective teaching and for the 
construction of diagnostic tests. 
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At the outset It was decided that the materials collected must be 
based on the language repertoire of the children and that teachers would 
be actively Involved In seminars, field-work, marking of scripts and feed- 
back of Information to pupils so as to maximise the effects of the 
experiment . 

II Some Characteristics of the Chinese Language 

Generally, the Chinese language can be classified as monosyllabic, 
lslating and analytic. 

Chinese can be said to be monosyllabic as each character Is a 
phonemic element or a syllable. But this does not apply to words which 
may contain more *-.han one phonemic element. An example Is the equivalent 
of the word "madam" ( Kk ) where each character by Itself means 
"husband" ( K> and "person" (A,). Chinese does not have, as certain 
other Languages, disyllabic or polysyllabic stem-words. 

Chinese can be said to b^ Isolating In contrast to inflexional 
Languages where there Is a merging of semantically distinct features 
either In o single bound form or In closely united bound forms or to 
agglutinative Languages where bound forms follow one another. The char- 
acter (AO means equally "man" "man's", "men", "men's" "the man", 

"the man's", "the men" and "the r.en's". The use of classifiers surmounts 
the difficulty. 

Chinese Is better described as analytical where each word is a one- 
syllable morpheme or a compound word or phrase-word and where there Are 
few bound forms. This Is In contrast to synthetical Languages which use 
many bound forms. 

In addition, Chinese Is distinguished by the use of pitch as primary 
phoneme, northern Chinese differentiates four of thesei a) high level, 
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e.g. "mother" ah ) pronounced as /raa/; b) high rising - "hemp" (j^) 
os /taa/; c) low rising - "horse" (*$)) as /ma/ and d) low falling - "scold" 
(J^j ) as /ma/. Southern Chinese further adds two tones, vis., the high and 
the low tones for each of the above pitch levels and an intermediate tone 
for the low falling pitch as well, making a total of nine tones. The bases 
of all these tonal differences are: a) duration * the relative length of 

time through which the vocal organs ore kept In a position and b) stress - 
Intensity or loudness. 

From the foregoing It can be seen that there Is a large number of 
homonyms and homophones. Thus the character with the short vowel sound 
"i" con mean something like thirty-eight different words. 

III. Re view of Previous Studie s 

Very broadly, vocabulary studies fall Into three categories. One 
group of researchers such as Burroughs (1957) studies the oral vocabulary 
of children. The second group such as Rlnsland (1947) studies the work 
written by children. The third group studies children's works, Thorndike's 
word count (1944) being a classic example. 

In the case of the Chinese language Chuang (1938) compiled a "funda- 
mental vocabulary" of 5262 characters. The list Is divided into characters 
most frequently used with about 2800 characters; those less frequently 
used and those rarely used with about 1200 characters in each of these 
Latter two categories. More recently, Kennedy (1954) edited the minimum 
voca lularles of written Chinese. The List of 1020 characters Includes 
those which are used at Yale University for checking purposes. The List 
of 2421 characters Is the official List for adult education In mainland 
China. This is further broken down Into! a) 1010 most frequently used 
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characters forming with another 490 characters a limiting list of 1500 
characters; b) a supplementary list of 500 characters and c) a list of 
technical terms of 421 characters. 

While the above two studies are useful they would not be entirely 
applicable to the present study. Chuang's list Is over thirty years 
old and may not accord with contemporary usage. Kennedy's list is helpful 
but the selection of characters needs to be refined. 

IV . Methodology of Present Stud y 

1 ) B asal Year of Study 

Children in the third year of primary schooling (age 9+) were 
selected for the study as by this time they will have mastered the mechan- 
ics of reading end writing and as there Is evidence that the third year 
in the primary is a critical period for these skills at the higher level. 
Moreover, results from the annual achievement testing programme in the 
basic subjects of languages and primary mathematics were available from 
this grade upwards for comparison and further analysis. 

2) Selection of Test Matarlals 

Previous studies show that by the end of six years of primary school- 
ing, children will have acquired a Chinese reading vocabulary of about 
3500 to 4000 characters. At the end of the third year of primary schooling 
children will have learnt about 1500 to 2000 character*. 

In the selection of test materials care was taken to ensure that: 
a) the characters be serviceable and within the experiential background 
of the children and b) the written vocabulary be contained In the reading 
vocabulary . 

Altogether 1851 different or token characters wete sampled. These 




were Incorporated Into thirty connected prose passages esch dealing with 
a specific topic such as "A Picnic," "Santa Claus" and each containing 
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about LOO characters. Of the corpus of characters tested recurring ones 
amounted to about 407. and for each passage abou*- 60 new or different 
characters were tested. 

3) Sampling 

The experiment was conducted from November, 1966 to May, 1967. 

A cluster sample of three to four Government primary schools was used 
for six to eight passages. The sample size ranged from 407 to 461 and 
was In excess of the 390 required for a finite population of 15000 
Government Primary III children with an assurance of an error not ex- 
ceeding L7. at the 957. confidence level. There was no significant dif- 
ference In respect of age, sex and academic performance between sample 
and population as shown by chl-squaro tests. 

4) Experimental Procedure 

a) The "language perception test" technique through the use of 
the tape recorder was employed. Each passage of thirty minutes duration 
including explanations was recorded on tape. This made possible the 
administration of the taped "tests" In a standardised situation by the 
class teacher. 

b) Before the test proper, the content of each passage was ex- 
plained fully and visual aids was used where possible. After the recorded 
explanation the passage was reproduced from the tape recorder three times 
at dictation speed. The children Listened on the first occasion, wrote 
down each character on the second occasion and reviewed their writing on 
the third. 

c) After the "dictation" the children's versions were marked 

by the teacher. Each character wts endorsed as either correct or incorrect. 
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Gradations and patterns of errors were ignored at this stage. The manual 
marking enabled teachers to feed back relevent information to the pupils. 

d) Analysis made possible the derivation of a facility index 
for each character. The facility index is expressed as the percentage 
of all participating children who wrote a character correctly. The 407. 
or so recurring characters provided a reliability estimate for these 
characters. Table I below shows the number of characters grouped by 
facility levels: 

Table I 



Facility Levels (F) No. of Characters 



90%- 100% 


292 






80X-897. 


229 






7.0%- 7.9% 


198 


total 


719 


607.-69% 


169 






5.0%- 59% _ 


183 


total 


352 


40%-497. 


160 






307.-397. 


176 






20%-29% 


184 






10%- 197. 


137 






. 0%-9% 


123 


total 


780 






total 


1851 



V. Discussion of Results 

1) The 1851 token characters are grouped under type characters in 
"dictionary" order with radicals or lexical units followed by characters 
arranged according to the complexity or numbers of strokes within each 
character. A facility Index Is given for each Character as shown In 
Appendix I. 

2) The arbitrary threshold score of 70% facility Index Is taken as 

the lower limit to Indicate tho »e characters which are reproduced correctly 
by nearly three quarters of the Primary It? children. Thus the 719 char* 
acters In Table I above should be regarded as the minimum core of written 
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vocabulary of children at this level. The value of 707. ensures the possi- 
bility of written communication among the majority; It leaves opportunity 
for further learning to attain near 1007. facility and It takes account 
of the educationally disadvantaged. 

3) There was some evidence of Zlpf's law (1949) operating In relation 
to word frequency and word length. It was found that other things 

being equal the mean number of strokes for characters of median difficulty 
was in the neighbourhood of fourteen strokes. Perhaps this is thn magic 
number seven plus or minus two for Chinese orthography. 

4) The study has raised a number of interesting problems and pointed 
to avenues of further research. One such Is thr need for basic research 
into the auditory reception of the language and the encoding of the sym- 
bols received. Another is the exploration of the cognitive aspect of 
grapheme -phoneme correspondence of the language. Chinese probably rates 
low in the requirements for good communication symbols. But In the 
balance and symmetry of Its logographlc form It probably carries more 
Information per chunk and certainly ha9 what Chao (1968) colls "elegance”. 
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